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Rule 41 ■37(c)(1)(i) Real party of interest 

Texas Instruments Incorporated owns the application. 

Rule 41 ■37(c)(1)(li) Related appeals and interferences 

There are no related dispositive appeals or interferences. 

Rule 41 .37(c)(1 )(iii) Status of claims 

Claims 1-8 are pending in the application with all claims finally rejected. 
This appeal involves the finally rejected claims. 

Rule 41 .37(c)(1 )(lv) Status of amendments 

There is no amendment after final rejection. 

Rule 41 .37(c)(1)(v) Summary of claimed subiect matter 

The claimed subject matter relates to playout (scheduling and decoding) 
of packetized frames of digital speech when packets arrive late: claim 1 relates to 
playout methods which truncate frames (used to reduce a too-large playout 
delay), claim 4 relates to playout methods which expand frames (used to fill in 
when playout delay jumps), and claims 7-8 relate to receivers capable of playout 
including (decoded) frame truncation and expansion as in claims 1 and 4. In 
general, frames of digital speech may be encoded (to reduce the number of bits 
required) and transmitted over networks in the form of packets analogous to data 
transmission (e.g., VoIP over the Internet), but transmission delays may cause 
packets to arrive late. In this case, playout methods attempt to manage the time 
gap caused by the late frames; Fig.1 illustrates varying arrival times and the 
expansion of frame m by repeated portions T^""' until frame m+1 is available. 
After the late arrivals, the playout methods attempt to reduce the increased 
playout delay back towards a target playout delay by shortening the length of 
some frames. 

More particularly, for claim 1 , application page 1 1 , last paragraph and 
page 1 2, middle paragraph describe the shortening of frames by truncation of 
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silence frames (e.g., a 20 millisecond frame shortened to 4 milliseconds by 
cutting off the last 16 milliseconds), but not truncating active speech frames. 

For claim 4, application page 12, last paragraph to page 13, paragraph (1) 
describe the expansion of voiced frames by integer multiples of the pitch as 
illustrated in Fig.1 where frame m is expanded with three repeats of a pitch 
length portion labeled T^'"^ 

For claim 7, Fig. 5 illustrates the functional blocks of a receiver which 
provides playout frame expansion as in claim 4. 

Rule 41 ■37(c)(1)(vi) Grounds of rejection to be reviewed on appeal 
The grounds of rejection to be reviewed on appeal are: 

(1) Claims 1-6 were rejected as anticipated by the Gersho reference. 

(2) Claims 7-8 were rejected as unpatentable over the Gersho reference in 
view of the Maeda reference. 

Rule 41 ■37(c)(1)(vii) Arguments 

(1) Claims 1-6 were rejected as anticipated by Gersho. Claims 1 and 4 
are the independent claims and are argued separately. 

With regard to claim 1 , clause (a), the Examiner asserted that encoding 
meets the limitation "truncation" and cited column 5, lines 49-51 , column 19, lines 
44-50, column 8, lines 23-27 plus blocks 34, 36, and 42 of Fig. 8 for showing 
deferred; and with regard to claim 1 , clause (b), the Examiner cited column 14, 
lines 40-42 and column 8, lines 23-27. 

Appellants reply as follows: 

(i) Preliminarily, claim 1 claims playout (e.g., scheduling and decoding) of 
packetized frames, whereas Gersho Fig.8 is an encoder (column 14, lines 52-54) 
and is essentially irrelevant. The more pertinent part of Gersho is the decoder of 
Fig.14. 

(ii) Gersho column 5, lines 49-51 is the entry in the Brief Description of the 
Drawings for Fig.8 and simply states that Fig.8 is a block diagram of a coder and 
nothing further. There is no suggestion of frame truncation. 
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(iii) Gersho column 19, lines 44-50 describes the choice of a window 
length of either 16 or 24 samples, depending upon the pitch in the frame. As 
explanation, column 19, lines 52-55 state a window is defined around each 
epoch, and column 19, lines 6-7 state epochs represent centers of windows in 
the frame. And column 7, lines 34-50 state that a window is an active interval 
with most of the energy of the excitation; that is, a window is within an active 
frame. There is no suggestion of frame truncation. 

(iv) Gersho column 8, lines 23-27 state a subframe boundary is moved if a 
window crosses the boundary. As explanation, Fig.2 and column 7, lines 18-27 
state a frame is partitioned into M subframes of nearly equal length. Again, no 
suggestion of frame truncation. 

(v) Gersho Fig. 8 blocks 36, 34, and 42 do the following: block 36 checks a 
frame of 160 samples for speech activity or silence (column 15, lines 25-26); 
block 34 is the first step in classifying a frame with activity into voiced, unvoiced, 
or transition (column 16, lines 5-7); and block 42 encodes the excitation and 
synthesizes speech for the frame as shown in Fig. 9 (column (column 18, lines 
42-43). And Fig. 9 shows three encoders, one for each active frame class, but 
does not suggestion frame truncation. Indeed, the plain English meaning of 
"truncate" is "shorten by cutting off". Thus truncating a frame means discarding 
some of the samples of the frame, such as truncating a 20 ms frame with 1 60 
samples, x[0], x[1], x[159], to 4 ms would mean retaining samples x[0], x[1], 

x[31] and discarding samples x[32], x[33], x[159]. Gersho has no 
suggestion of frame truncation. The Examiner's assertion of coding as the same 
as truncating confuses two separate ideas: truncation of a frame versus coding 
with quantization of parameter values which may use truncation of least 
significant bits to lower the bit count. Truncation of a frame shortens the frame 
by discarding samples of the frame, whereas coding (as in Gersho) takes all of 
the samples of the frame and generates a set of parameters (e.g., LP filter 
coefficients, pitch, gain, excitation, etc.) with a small number of bits which can be 
transmitted at a low bit rate and then can be used to reconstruct an 
approximation of the all of the samples of the frame. But such truncation as part 
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of coding is not related to truncation of the frame whicfi discards samples. As an 
analogy, simply quantizing the data would lower the bit count; for example, the 
160 samples of a frame may have 16-bit integer format, and quantizing 
(truncating) each sample to 8-bit integer format by discarding the 8 least 
significant bits would retain all 160 samples, but each sample would have lower 
resolution. That is, truncation of each sample's least significant bits is not 
truncation of the frame which would discard samples to shorten the frame. 

(vi) Gersho column 14, lines 40-42 states that the first stage of the frame 
classifier (block 34 of Fig. 8) decides whether the frame is unvoiced or not. No 
suggestion of frame truncation. 

(vii) Gersho column 19, lines 44-50 was previously argued in (iii). 

(viii) Gersho column 8, lines 23-27 was previously argued in (iv). 

In short, Gersho does not suggest frame truncation, and consequently 
does not suggest independent claim 1 or its dependent claims 2-3. 

With regard to claim 4, clause (a), the Examiner cited Fig. 14 elements 
lOe/lOg, and with regard to claim 4, clause (b), the Examiner cited Fig. 14 
elements lOg-IOh, column 28, lines 46-53, column 15, lines 23-24, and column 
8, lines 23-27. 

Appellants reply as follows. 

(i) Gersho Fig. 14, elements lOe/lOg indicates construction of the 
excitation for either an unvoiced or a voiced frame; however, claim 4, clause (a) 
requires a classifying of the frame. But Gersho Fig. 14, element 10a does decode 
the classification of the incoming frame, so Fig.14 does indeed show the frame 
classifying of claim 4, clause (a). 

(ii) Gersho Fig.14, elements lOg-IOh construct the excitation for a voiced 
frame and decode the pitch for a voiced frame, respectively; neither suggests 
anything beyond the usual decoding of a GELP-encoded frame. In contrast, 
claim 4, clause (b) requires frame expansion (i.e., after decoding the frame, 
expanding the frame beyond its original length) in a playout method. There is no 
suggestion of frame expansion in Fig.14. 
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(iii) Gersho column 28, lines 42-55 describe decoding the pitch and 
constructing the excitation for a voiced frame as part of decoding the frame; 
however, there is no suggestion of frame expansion. 

(iv) Gersho column 15, lines 23-24 notes the bandwidth expansion 
correlation coefficient, Rw Bandwidth expansion relates to moving zeroes of the 
transfer function of the LP filter towards or away from the unit circle in the z- 
plane; this is unrelated to frame expansion. 

(v) Gersho column 8, lines 23-27 state as part of encoding a subframe 
boundary is moved if a window crosses the boundary. As explanation, Fig. 2 and 
column 7, lines 18-27 state a frame is partitioned into M subframes of nearly 
equal length. Again, no suggestion of frame expansion, especially of decoded 
frames being played out and which were classified as voiced and with the 
expansion being a repeat of pitch length portions. 

In short, Gersho does not suggest frame expansion, and consequently 
does not suggest independent claim 4 or its dependent claims 5-6. 

(2) Claims 7-8 were rejected as unpatentable over Gersho in view of 
Maeda. Claims 7-8 related to devices which implement methods of claims 1 -6, 
so the Examiner again pointed to Gersho column 28, lines 42-55, column 15, 
lines 34-34, and column 8, lines 23-27 for frame expansion and added Maeda for 
a playout scheduler. 

Appellants repeat the foregoing argument with regard to claim 4 that 
Gersho does not suggest frame expansion as required by independent claim 7. 

Consequently, the independent claims (claims 1 , 4, and 7) are patentable 
over the references, and thus all claims patentable over the references. 
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Rule 41 ■37(c)( 1 )(viii) Claims appendix 



1 . A method for playout of packetized speech, comprising: 

(a) deferring truncation of an active frame; and 

(b) truncating a silence frame. 

2. The method of claim 1 , wherein: 

(a) said packetized speech includes CELP-encoded frames; and 

(b) said truncating a silence frame includes truncating an excitation for said 
silence frame. 

3. The method of claim 1 , further comprising: 

(a) expanding an active frame according to a voicing classification for said 
active frame. 

4. A method of frame playout expansion, comprising: 

(a) classifying a frame as voiced or not; and 

(b) expanding a voiced frame by a multiple of the pitch of said voiced frame. 

5. The method of claim 4, wherein: 

(a) said frames are CELP-encoded frames; and 

(b) said expanding a voiced frame includes expanding an excitation for said 
voiced frame by a multiple of the pitch of said voiced frame.. 

6. The method of claim 4, wherein: 

(a) said classifying a frame of step (a) classifies an active frame as one of (i) 
voiced, (ii) unvoiced, or (ill) transition; and 

(b) expanding an unvoiced frame includes expanding an excitation for said 
unvoiced frame with a random fixed-codebook vector. 
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7. A receiver, comprising: 

(a) an input for receiving CELP-encoded frames; 

(b) a decoder coupled to said input; and 

(c) a playout sclieduler coupled to said input; 

(d) said decoder operable to provide expansion of a voiced frame in response 
to said playout scheduler, wlierein said expansion is a multiple of the pitch for 
said voiced frame. 

8. The receiver of claim 7, wherein: 

(a) said decoder operable to provide truncation of a frame in response to said 
playout scheduler only when said frame is a silence frame. 

Rule 41 .37(c)(1)(ix) Evidence appendix 
none 

Rule 41 .37(0^1 )(x) Related proceedings appendix 
none 



Page 8 of 8 



